[Torch] Remove unnecessary reshapes for batch_matmul #7675

comaniac · 2021-03-16T20:39:16Z

This PR removes unnecessary reshape ops in the PyTorch frontend when converting to batch_matmul. This should help the performance of NLP models such as BERT.

cc @siju-samuel @masahi

tests/python/frontend/pytorch/test_forward.py

comaniac · 2021-03-16T21:41:37Z

Pushed a new commit to also reorder the reshape_b and transpose so that the simplify expression can be used.

Before this PR:

fn (%input0: Tensor[(10, 3, 4), float32], %input1: Tensor[(10, 4, 5), float32]) -> Tensor[(10, 3, 5), float32] {
  %0 = reshape(%input0, newshape=[-1, 3, 4]) /* ty=Tensor[(10, 3, 4), float32] */;
  %1 = reshape(%input1, newshape=[-1, 4, 5]) /* ty=Tensor[(10, 4, 5), float32] */;
  %2 = transpose(%1, axes=[0, 2, 1]) /* ty=Tensor[(10, 5, 4), float32] */;
  %3 = nn.batch_matmul(%0, %2, meta[relay.attrs.BatchMatmulAttrs][0]) /* ty=Tensor[(10, 3, 5), float32] */;
  reshape(%3, newshape=[10, 3, 5]) /* ty=Tensor[(10, 3, 5), float32] */
}

fn (%input0: Tensor[(10, 3, 4), float32], %input1: Tensor[(4, 5), float32]) -> Tensor[(10, 3, 5), float32] {
  %0 = reshape(%input0, newshape=[-1, 3, 4]) /* ty=Tensor[(10, 3, 4), float32] */;
  %1 = reshape(%input1, newshape=[-1, 4, 5]) /* ty=Tensor[(1, 4, 5), float32] */;
  %2 = transpose(%1, axes=[0, 2, 1]) /* ty=Tensor[(1, 5, 4), float32] */;
  %3 = nn.batch_matmul(%0, %2, meta[relay.attrs.BatchMatmulAttrs][0]) /* ty=Tensor[(10, 3, 5), float32] */;
  reshape(%3, newshape=[10, 3, 5]) /* ty=Tensor[(10, 3, 5), float32] */
}

fn (%input0: Tensor[(1, 12, 14, 64), float32], %input1: Tensor[(1, 12, 64, 14), float32]) -> Tensor[(1, 12, 14, 14), float32] {
  %0 = reshape(%input0, newshape=[-1, 14, 64]) /* ty=Tensor[(12, 14, 64), float32] */;
  %1 = reshape(%input1, newshape=[-1, 64, 14]) /* ty=Tensor[(12, 64, 14), float32] */;
  %2 = transpose(%1, axes=[0, 2, 1]) /* ty=Tensor[(12, 14, 64), float32] */;
  %3 = nn.batch_matmul(%0, %2, meta[relay.attrs.BatchMatmulAttrs][0]) /* ty=Tensor[(12, 14, 14), float32] */;
  reshape(%3, newshape=[1, 12, 14, 14]) /* ty=Tensor[(1, 12, 14, 14), float32] */
}

After this PR:

fn (%input0: Tensor[(10, 3, 4), float32], %input1: Tensor[(10, 4, 5), float32]) -> Tensor[(10, 3, 5), float32] {
  %0 = transpose(%input1, axes=[0, 2, 1]) /* ty=Tensor[(10, 5, 4), float32] */;
  nn.batch_matmul(%input0, %0, meta[relay.attrs.BatchMatmulAttrs][0]) /* ty=Tensor[(10, 3, 5), float32] */
}

fn (%input0: Tensor[(10, 3, 4), float32], %input1: Tensor[(4, 5), float32]) -> Tensor[(10, 3, 5), float32] {
  %0 = transpose(%input1, axes=[1, 0]) /* ty=Tensor[(5, 4), float32] */;
  %1 = reshape(%0, newshape=[-1, 5, 4]) /* ty=Tensor[(1, 5, 4), float32] */;
  nn.batch_matmul(%input0, %1, meta[relay.attrs.BatchMatmulAttrs][0]) /* ty=Tensor[(10, 3, 5), float32] */
}

fn (%input0: Tensor[(1, 12, 14, 64), float32], %input1: Tensor[(1, 12, 64, 14), float32]) -> Tensor[(1, 12, 14, 14), float32] {
  %0 = reshape(%input0, newshape=[-1, 14, 64]) /* ty=Tensor[(12, 14, 64), float32] */;
  %1 = transpose(%input1, axes=[0, 1, 3, 2]) /* ty=Tensor[(1, 12, 14, 64), float32] */;
  %2 = reshape(%1, newshape=[-1, 14, 64]) /* ty=Tensor[(12, 14, 64), float32] */;
  %3 = nn.batch_matmul(%0, %2, meta[relay.attrs.BatchMatmulAttrs][0]) /* ty=Tensor[(12, 14, 14), float32] */;
  reshape(%3, newshape=[1, 12, 14, 14]) /* ty=Tensor[(1, 12, 14, 14), float32] */
}

In particular, since the weights in most PyTorch models have to be transposed when converting to Relay, the second case, for example, could be:

fn (%input0: Tensor[(10, 3, 4), float32], %input1: Tensor[(5, 4), float32]) -> Tensor[(10, 3, 5), float32] {
  %0 = transpose(%input1, axes=[1, 0]) /* ty=Tensor[(4, 5), float32] */; <- Not added by matmul
  %1 = transpose(%0, axes=[1, 0]) /* ty=Tensor[(5, 4), float32] */; <- Added by matmul
  %2 = reshape(%1, newshape=[-1, 5, 4]) /* ty=Tensor[(1, 5, 4), float32] */;
  nn.batch_matmul(%input0, %2, meta[relay.attrs.BatchMatmulAttrs][0]) /* ty=Tensor[(10, 3, 5), float32] */
}

By applying SimplifyExpr to cancel unnecessary transpose, we could have:

fn (%input0: Tensor[(10, 3, 4), float32], %input1: Tensor[(5, 4), float32]) -> Tensor[(10, 3, 5), float32] {
  %0 = reshape(%input1, newshape=[-1, 5, 4]) /* ty=Tensor[(1, 5, 4), float32] */;
  nn.batch_matmul(%input0, %0, meta[relay.attrs.BatchMatmulAttrs][0]) /* ty=Tensor[(10, 3, 5), float32] */
}

masahi · 2021-03-17T04:21:29Z

Thanks @comaniac

* [Torch] Remove unnecessary reshapes for batch_matmul * lint * fix * reorder * lint

comaniac added 3 commits March 16, 2021 20:34

[Torch] Remove unnecessary reshapes for batch_matmul

f791456

lint

f8402a1

fix

33f4bfe

masahi reviewed Mar 16, 2021

View reviewed changes

tests/python/frontend/pytorch/test_forward.py Outdated Show resolved Hide resolved

reorder

3104fd2

lint

219dd76

masahi approved these changes Mar 17, 2021

View reviewed changes

masahi merged commit 4abbe49 into apache:main Mar 17, 2021

comaniac deleted the pytorch_remove_reshape branch March 17, 2021 16:30

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request May 6, 2021

[Torch] Remove unnecessary reshapes for batch_matmul (apache#7675)

0d200e4

* [Torch] Remove unnecessary reshapes for batch_matmul * lint * fix * reorder * lint

trevor-m pushed a commit to neo-ai/tvm that referenced this pull request May 11, 2021

[Torch] Remove unnecessary reshapes for batch_matmul (apache#7675)

508b4c1

* [Torch] Remove unnecessary reshapes for batch_matmul * lint * fix * reorder * lint

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Torch] Remove unnecessary reshapes for batch_matmul #7675

[Torch] Remove unnecessary reshapes for batch_matmul #7675

comaniac commented Mar 16, 2021

comaniac commented Mar 16, 2021

masahi commented Mar 17, 2021

[Torch] Remove unnecessary reshapes for batch_matmul #7675

[Torch] Remove unnecessary reshapes for batch_matmul #7675

Conversation

comaniac commented Mar 16, 2021

comaniac commented Mar 16, 2021

masahi commented Mar 17, 2021